arXiv:1507.07559v3 [hep-th] 18 Jan 2016 


DESY-15-126, LMU-ASC 47/15 


(MS)SM-like models on smooth Calabi-Yau manifolds from all three 

heterotic string theories 

Stefan Groot Nibbelink a Q Orestis Loukas a,b 0 Fabian Ruehle c {^] 

a Arnold Sommerfeld Center for Theoretical Physics, 
Ludwig-Maximilians-Universitat Miinchen, 80333 Miinchen, Germany 

b School of Electrical and Computer Engineering, 

National Technical University of Athens, Zografou Campus, GR-15780 Athens, Greece 

c Deutsches Elektronen-Synchrotron DESY, Notkestrasse 85, 22607 Hamburg, Germany 


Abstract 

We perform model searches on smooth Calabi-Yau compactihcations for both the supersymmetric 
EgxEg and SO(32) as well as for the non-supersymmetric SO(16)xSO(16) heterotic strings simulta¬ 
neously. We consider line bundle backgrounds on both favorable CICYs with relatively small /in and 
the Schoen manifold. Using Gram matrices we systematically analyze the combined consequences of 
the Bianchi identities and the tree-level Donaldson-Uhlenbeck-Yau equations inside the Kahler cone. 
In order to evaluate the model building potential of the three heterotic theories on the various geome¬ 
tries, we perform computer-aided scans. We have generated a large number of GUT-like models (up to 
over a few hundred thousand on the various geometries for the three heterotic theories) which become 
(MS)SM-like upon using a freely acting Wilson line. For all three heterotic theories we present tables 
and figures summarizing the potentially phenomenologically interesting models which were obtained 
during our model scans. 
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1 Introduction 


There exist three heterotic string theories in ten dimensions that have modular invariant parti¬ 
tion functions and are tachyon-free: the supersymmetric EsxEs and SO(32) theories and the non- 
supersymmetric SO(16) xSO(16) theory. Even though all three of them were roughly discovered around 
the same time, essentially only the EsxEs theory has been examined in string model building on 
Calabi-Yau manifolds in a more systematic fashion. On smooth compact orbifold resolutions semi- 
realistic models were obtained from the EgxEg theory using line bundles era In the class of favorable 
Complete Intersection Calabi-Yaus (CICYs) with discrete symmetries, specific S(U(1)' 5 ) line bundle 
constructions embedded in a single Es factor have been investigated in |4]jj6]. 

On the other hand, the SO(32) theory has only been studied sporadically regarding phenomeno¬ 
logical applications, although the general framework has been laid out in (T] . The model building 
potential of the non-supersymmetric SO(16) xSO(16) theory compactified on smooth Calabi-Yau ge¬ 
ometries has not really been investigated so far; in [ 8 j[9 this option was at least mentioned. In 
particular, as explained in j9, 10 |, one can avoid tachyons to leading order in a' and g s when working 
in the large volume regime on smooth Calabi-Yaus. 

Systematic model scans for MSSM (Minimal supersymmetric Standard Model)-like models have 
again been performed for the EsxEs theory on string backgrounds that admit exact CFT (Conformal 
Field Theory) descriptions. The most prominent ones have been toroidal orbifold model searches: In 
Refs. [Tl] 141 MSSM-like models have been constructed on the Zg-n orbifold. Related investigations 
have been carried out on the Z 2 x Z 4 orbifold [15], the Z 12-1 
and Z§ orbifolds 


19 


115], the Z 12 - 

an overview can be found in Ref. 


using the free fermionic formulation 
model searches 


orbifold [16 17 , a Z 2 x Z 2 orbifold 118 


20 . In addition, there have been model scans 


21-24 . Moreover, also Gepner constructions have been used for 


Model searches for non-supersymmetric string constructions with an exact CFT description have 
been sporadically considered in the past compared to those in the supersymmetric context. A pioneer¬ 
ing investigation of non-supersymmetric models was carried out in Ref. [27 , which proposes for the 
first time ways to break supersymmetry spontaneously in string theory (see also [28]). After that it was 
shown that it is possible to obtain constructions that interpolate between the supersymmetric heterotic 
EsxEs and then non-supersymmetric SO(16)xSO(16) theories 29 . Conditions for vacuum stability 
in non-supersymmetric closed string models were determined in Ref. 30 : In order to avoid tachyons 


the total number of (massless and massive) bosonic and fermionic degrees of freedom must be equal 
asymptotically. Refs. [31,32 pointed out that the spectrum fulfills certain supertrace relations which 
are very similar to supersymmetric theories, and therefore this was called misaligned supersymmetry. 
The corresponding oscillatory behavior of the (classically stable) non-supersymmetric closed strings 


spectra are related to the non-trivial zeros of the Riemann zeta function 


Non-supersymmetric 


orbifolds were investigated in Refs. [8,34 
and other string contexts can be found e.g. in 
from rational CFTs in Refs. 


Further aspects of non-supersymmetric models in heterotic 
Non-supersymmetric models were constructed 


35-42 


43,44 


Moreover, recently such investigations have been carried out by various groups in the heterotic 
orbifold constructions 191 and free fermionic models 45,46 . In 47 it was found that the difference 


of thresholds for non-abelian gauge still exhibits a universal behavior in some heterotic constructions 
with spontaneously broken supersymmetry. 

Stimulated by this revived interest in non-supersymmetric string constructions one of the main 
purposes of this work is to investigate the phenomenological prospects of the non-supersymmetric 
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SO(16)xSO(16) string on smooth Calabi-Yau spaces. The techniques that are used in this paper have 
been laid out in detail in the paper 10 . To make the presentation self-contained we review some of 


that material in this work. However, here we always have the automated implementations of these 
methods in mind. 

At the same time, the current paper aims to present a comprehensive study of all three heterotic 
theories simultaneously to examine whether such a bias in favor of the supersymmetric Eg x Eg heterotic 
string for smooth compactifications in the past is justified. For that we choose a subset of the often 
considered favorable CICYs that admit freely acting Zjv symmetries and the Schoen manifold. We set 
up our analysis in such a way that we are able to treat the three heterotic string theories side-by-side. 

To this end we consider line bundle backgrounds on these smooth geometries characterized by so- 
called line bundle vectors. Except for the fact that for the three different theories we need to take these 
vectors out of different lattices, their consistency conditions are identical and the resulting spectra can 
be computed employing very similar methods. Using these universal algorithms we can investigate 
the model building potential of the three heterotic theories. Since in the present work we are mainly 
interested in statistical findings, we try to avoid as much as possible putting any phenomenological 
bias in the construction of smooth models. This enables us to examine how the phenomenologically 
interesting models are distributed among the more general landscape of a given geometry. 

To obtain systematic model building results it is almost unavoidable to make use of computer- 
automated model scans. In this paper we lay out in detail how we have set up such model searches 
using procedures that allow for quite general line bundle ansatzes, i.e. parameterizations of many 
embeddings in the whole Cartan sub-algebra of the ten-dimensional gauge group. Nevertheless our 
algorithms are very fast, especially due to the use of index theorems to calculate the chiral spectrum. 
The latter point is very important if one wants to investigate the physical consequences of many 
different line bundles in a statistical manner. 

Since the construction and analysis of such smooth models is technically rather involved, we 
apply various consistency checks on our resulting spectra at the chiral level. In particular, for each 
constructed model we check that all charged massless states have integral multiplicities and that all 
(pure and mixed) gauge and gravitational anomalies cancel in four dimensions. In addition, for the 
big subset of the constructed models that are chiral exact (i.e. give exactly three chiral generations of 
quarks and leptons) we have also calculated the spectra using bundle cohomology. These tests provide 
strong cross checks on both the spectrum and the topological data entering its computation. 

The model searches that we present are mainly focused on obtaining chiral GUT-like models 
which become MSSM-like (for the supersymmetric theories) or SM(Standard Model)-like (for the 
non-supersymmetric theory) upon acting with a freely acting Wilson line. Our side-by-side approach 
for these three heterotic theories means that, aside from many MSSM-like models from the EsxEg 
theory, we obtain a big novel set of MSSM-like constructions from the SO(32) theory. In addition, we 
construct a large collection of SM-like models from the non-supersymmetric SO(16)xSO(16) theory. 


Paper overview 

The remainder of this paper is organized as follows: In Section[2]we briefly review the basic topological 
properties of the smooth Calabi-Yau manifolds we need for our model building analysis focusing on 
favorable CICYs with relatively small hn and the Schoen manifold. In the next section we summarize 
an efficient parameterization of line bundles on smooth spaces. In particular we introduce a convenient 
rewriting of the Bianchi identities and the Donaldson-Uhlenbeck-Yau (DUY) equations. Section [4] 
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explains our model building constructions and analysis in detail. In Section [5] we implemented these 
ideas to perform model searches for GUT- and (MS)SM-like spectra of the three heterotic string 
theories. We compare the outcome of these model scans side-by-side and present statistics on four¬ 
dimensional SM chirality among models with an SU(5) gauge group. In addition, we inspect the 
non-chiral spectrum at the GUT level for those (MS)SM-like models which have exactly three chiral 
generations of quarks and leptons. 


2 Examples of smooth Calabi-Yau manifolds 

In this section we describe the basic topological properties of two types of Calabi-Yau manifolds which 
we have used in our model searches. This data contains the Hodge numbers (/in,/i 2 i) counting the 
number of Kahler and complex structure deformations, and values of the integrated second and third 
Chern classes 

C2i= C 2 , c 3 = / C 3 , (1) 

J Di Jx 

for i = /in. Finally, the intersection number of three divisors Di, Dj and is defined by 


^ijk — / Dj Dk , ( 2 ) 

Jx 

where Di denotes the (l,l)-form Poincare-dual to D^. In particular, the volumes of any curve C, 
divisor D and the manifold X itself can be obtained from 

Vol(C) = f J , Vol (D) = \\ J 2 , Vol(Y) = \ [ J 3 , (3) 

Jc Jd Jx 

using the intersection numbers and the expansion 

J = at Di (4) 

of the Kahler form. Inside the Mori and the Kahler cones the volumes of all curves C and all divisors 
D are positive. 

Generically, the quantity 


Qj(C) 



( 5 ) 


gives the number of intersections of a curve C with divisor Di . If we have an integral basis of curves 
{' Ci } and divisors {Di}, then 


qj [Ci) = f Dj = [ dj = f CiDj = Sij . (6) 

JCi J Di Jx 

This basis is minimal when any effective curve C can be written as a formal sum of the curves Ci with 
non-negative coefficients and a similar expansion holds for any divisor D. In a minimal integral basis 
the Kahler/Mori cone is simply given by a* > 0 for all i = 1,..., h\\ . 
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CICY # 

(/in, h' 2 i ) 

Total volume 

6 Vol(X) — l^ijk C^iC^jC^k 

Chern classes 

(c2i ; C3) 

7862 

( 4 , 68 ) 

12aia2(i3 + 12010204 + 12010304 + 12020304 

( 24 , 24 , 24 , 24 ;- 128 ) 

7491,7522 

( 4 , 44 ) 

12aia2(i3 + 24 aia 2 d 4 + 24040304 + 24020304 

+240104 + 240204 + 240304 + I6O4 

( 24 , 24 , 24 , 64 ; - 80 ) 

7447, 7487 

( 5 , 45 ) 

I201O2O3 + 12040204 + 12040304 + 12020304 + 12040205 

+ 12040305 + 12020305 + 12010405 + 12020405 + 12030405 

( 24 , 24 , 24 , 24 , 24 ;- 80 ) 

6770 

( 5 , 37 ) 

24010203 + 12010204 + 12010304 + 12020304 + 12010205 

+ 12040305 + 12020305 + 12010405 + 12020405 + 12030405 

( 24 , 24 , 24 , 24 , 24 ;- 64 ) 

6715,6788 

6836,6927 

( 5 , 37 ) 

12010203 + 12040204 + 12010304 + 12020304 + 24010205 

+24010305 + 24020305 + 24010405 + 24020405 + 24030405 
+ 24 aia| + 24 a 2 o| + 240305 + 24 a 4 a| + 16 a| 

( 24 , 24 , 24 , 24 , 64 ; - 64 ) 

6732,6802 

6834,6896 

( 5 , 37 ) 

12040203 + 12040204 + 12040304 + 12020304 + 12040205 

+24040305 + 24020305 + 24040405 + 24020405 + 24030405 

+ 120405 + 120205 + 240305 + 240405 + 805 

( 24 , 24 , 24 , 24 , 56 ;- 64 ) 

6225 

( 5 , 33 ) 

12040203 + 18040204 + 18040304 + 12020304 + 60404 

+60204 + 60304 + 18040205 + 18040305 + 12020305 
+24040405 + 24020405 + 24030405 + 12a|a5 + 6aia| 
+6a2a| + 603a! + 120405 

( 24 , 24 , 24 , 36 , 36 ;- 56 ) 

5302 

( 6 , 30 ) 

I204O2O3 + 12040204 + 12040304 + 12020304 + 12040205 

+12040305 + 12020305 + 12040405 + 12020405 + 12030405 

+12040206 + 12040306 + 12020306 + 12040406 + I2O2O4O6 

+ 12030406 + 12040506 + 12020506 + 12030,506 + 12040506 

( 24 , 24 , 24 , 24 , 24 , 24 ; - 48 ) 


Table 1 : This table summarizes the basic topological data for the CICY geometries with 4 < h\\ < 5 
used for model building in this paper. The second column gives the total Calabi-Yau volume from 
which the intersection numbers Kijk can be read off. The third column gives the integrated second 
and third Chern classes. 
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2.1 Complete intersection Calabi-Yaus 


A subset of Calabi-Yau manifolds, which is often considered in phenomenological applications, can be 
described as hypersurfaces (or complete intersections) in products of projective ambient spaces <g> a P fca . 
Such complete intersection Calabi-Yaus (CICYs) are defined by the degrees r aj 4 of the polynomials, 
labeled by A, that characterize the complete intersection under the projective scalings. Since we are 
considering favorable descriptions of CICYs in which all CY divisors are given as pullbacks of the 
projective ambient space hyperplane divisors, we have h\\ projective ambient space factors. In order 
to end up with a CY 3 -fold, we need a complete intersection of codimension ^ k a — 3 . The data 
defining the CICY can thus be summarized succinctly in terms of the h\\ x k a — 3 ) configuration 
matrix T = (T a n) in which each row specifies one factor and each column represents the degrees 
of one polynomial equation under the various projective scalings. 

One of the simplest examples of this is the quintic ( 4 | 5 ) which corresponds to a degree five ho¬ 
mogeneous polynomial in P 1 . Often, the first column which specifies the dimension of the projective 
space is omitted since for a CY it follows unambiguously from the sum of the degrees of the defining 
polynomials. In this case, the quintic has just ( 5 ) as its configuration matrix. 

In this paper we use CICYs from 48 . The data presented there is based on classifications 49-51 


Out of the 7890 CICYs we focus on a subset of the 74 manifolds which are favorable and allow for free 
discrete actions. They all have /in < 6. For the CICYs with /in < 3 we found that it is impossible 
to satisfy the model building conditions (Bianchi identities and DUY equations) while getting to the 
desired number of chiral SM families. Thus, we refrain from explicitly discussing those manifolds. 

In an accompanying paper [l 0 | we have reviewed some useful formulae to obtain the relevant 
topological data, following the techniques described in 52 . A summary of the topological data, 


intersection numbers and second/third Chern classes of the CICYs studied in this paper can be found 
in Table [lj In particular, the intersection numbers Kijk can be read off from the total volume Vol(X) 
of the corresponding CICY. For the favorable CICYs under consideration we always assume that we 
work in a minimal integral basis <©>■ 


2.2 The Schoen manifold 

As an alternative Calabi-Yau example we consider the so-called Schoen manifold. This manifold can 
be thought of as the hypersurface with configuration matrix: 



( 7 ) 


In this description of the Schoen manifold all complex structure moduli are encoded explicitly as 
deformation parameters of the two homogeneous polynomials. However, only three of the total 19 
Kahler moduli are realized explicitly. 

A different realization of the Schoen manifold that describes all Kahler parameters explicitly, is 
given in terms of a resolution of a particular Z2 x Z2 orbifold, the ( 0 - 2 ) orbifold in the Donagi- 
Wendland classification 53 , 54 . This description has the advantage that all /in = 19 divisors are 
described explicitly. These divisors are grouped into three types, denoted by 7 ?i,2 ,3, E r and E r . where 
r = (rir2rs) with r t = 0 , 1 . The divisors Ri are often referred to as inherited divisors as they arise 
from torus divisors on the underlying orbifold. The divisors E r and E r are called exceptional divisors 
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Total volume 

Chern classes 

Schoen 

manifold 

(19,19) 

6Vol(X) = 12ai ja 2 a 3 — E(M 2 } 
+12a 2 |aia 3 - EXM 2 } 

C2(Rl) = c 2 (i? 2 ) = 24, C 2 (i? 3 ) = 0 
c 2 (E r ) = c 2 (E r ) = 0 

c 3 = 0 

Some necessary conditions on the Kahler parameters 

Vol(X) > 0 

E(M 2 < a 2 a 3 

E(M 2 < a i a 3 

Vol(C') > 0 

Z)ri,r 2 ^rir27-3 < a 2 

Er3 ^nr2C 3 < a 3 

Er 3 br\r 2 V 3 + Er 3 ^rir2V3 < a 3 

Eri,r 2 ^rir- 2 r 3 < a l 

Er3 ^rir 2 r 3 < a 3 

^ j 6^ 7 ^ 0 

Vol(.D) > 0 

E ri ,, 2 Vol(K r . ir2r3 )<Vol(i7 3 ) 
EVol(E r ) 2 < 2 Vol(i?i)Vol(i? 3 ) 

E, 1 ,, 2 Vol(E rir2r3 )<Vol( J R 3 ) 
EVol(E r ) 2 < 2 Vol(i? 2 )Vol(i? 3 ) 


Table 2: This table both characterizes the basic topological data of the Schoen manifold and gives a 
number of necessary conditions to be inside the Kahler and Mori cones. 


as they correspond to the blowup cycles that appear in the resolution process. Even though the set 
{Ri, E r , E r } provides a convenient basis for many purposes, it should be stressed that it does not 
constitute a minimal integral basis. The intersection numbers can again be read off from the total 
volume of the Schoen manifold given in Table [2] using the expansion 

J = ^2 a i Ri ~ ^2 b r E r ~ ^2 b r E r (8) 

of its Kahler form. Here all b r , b r and a,; are positive and subject to the Mori cone conditions, some of 
which are listed in Table [2l Additional conditions are mentioned there to ensure that the total volume 
of X is positive. 


3 Heterotic line bundle models 


3.1 Line bundle vectors 

Next, we briefly characterize heterotic models on smooth compactifications of any of the three heterotic 
strings with line bundles. Concretely, we consider the gauge background 


T 

— = Di Hi 

2vr 


Hi = V/ H t 


(9) 


embedded in the Cartan subalgebra of the ten-dimensional gauge group Q. This gauge flux is charac¬ 
terized by a set of bundle vectors Vi = (V/). one for each divisor (l,l)-form Di (see e.g. 11,55 ). For 
the EgxEg and SO(16 )xSO( 16) theories these bundle vectors can be conveniently decomposed into 
two pieces V = (V-, V(') corresponding to the ten-dimensional gauge group factors. 

To ensure integral multiplicities for all massless states the bundle vectors are subject to the flux 
quantization conditions, 


qi(C)Vie A, 


( 10 ) 
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Constraint 

Model-building constraints on 

Condition on bundle vectors 

Vi with i = 1 ,..., h i i 

input data 

Condition on Gram matrices 

Kij = Kji with i,j = 1 ,..., /in 

Flux quantization 

V, even lattice vector 

| -K-ij | — -K-ii ^jj i 

G Z and Ka G 2 Zq" 

Bianchi identities 

^ijk Vj • T4 + 2 C 2 i — 0 

Kijk Kjj j + 2 C 2 i — 0 

Tree-level DUY 

£vol(A)tf = 0 

i 



Table 3 : The table summarizes the main model-building constraints. In the second column the 
conditions are stated in terms of the line bundle vectors, while in the third column we have rewritten 
the conditions in terms of the Ji-matrix, which is used for generating random models. 


for any curve C where qi{C) is defined in ([5]). The possible lattices A are given by 
Ae 8 xE 8 = (R 8 ©S 8 )<g>(R 8 ©S 8 ) , A SO ( 32 ) = Rl 6 ©Sl 6 , A SO ( 16 ) xSO ( 16 ) = (R 8 <g>R 8 ) © (S 8 <g>S 8 ) ( 11 ) 


for the E 8 xE 8 , SO( 32 ) or SO( 16 )xSO( 16 ) heterotic string, respectively. If { D &} defines a minimal 
integral basis of divisors (Jb]), like for the CICY geometries under inspection, the line bundle vectors 
lie automatically inside these lattices. For the Schoen manifold we do not employ a minimal integral 
basis, hence ( 10 ) results in more complicated conditions on the line bundle input data, see ( 3 | for a 
detailed exposition of these conditions. 


3.2 Gram matrix 

It turns out to be worthwhile to define the Gram matrix associated to a set of line bundle vectors as 

Kij = Kji = Vi-Vj = J2 V/V/ , ( 12 ) 

i 

with standard Euclidean inner product. Consequently, we have 

0 < Ka , \Kij \ 2 < Ka Kjj , ( 13 ) 

for i. j = 1 ,..., /in - The first condition is the statement that the vector-norm V 2 is non-negative. 
The second condition is a rewriting of the Schwarz inequality |V) • Vj I 2 < \Vi\ 2 \Vi\ 2 . 

All well-known conditions on a line bundle background can be written in terms of conditions on 
the associated Gram matrix. This is summarized in Table [ 3 ] for the cases where we have an integral 
basis ([ 6 ]). In detail we have: 


Flux quantization 


The flux quantization conditions ( 10 ) imply that the entries of the Gram matrix are restricted such 
that 


K; jqj (C) C Z Vi, 


(14) 












for any curve C, since all three lattices ( TT 1 ) are integral. Furthermore qj(C) E Z, since they are given 
in terms of intersection numbers, cf. ( 15 ). Thus, we always fulfill these conditions by taking integer 
entries for the Gram matrix. In the minimal integral basis ([6]) the entries of K necessarily satisfy 

Kij E Z , Ka E 2 Z>o . ( 15 ) 

The second condition arises because all three possible lattices © are even. 

Bianchi identities 

We assume throughout this work that we do not have any NS 5 -branes. In this case, the Bianchi 
identities for the B-field can be represented as 


l^ijk /,; T 2 C 2 i 0 


in terms of the Gram matrix ( 12 ). 


( 16 ) 


Donaldson-Uhlenbeck-Yau equations 

Finally, the gauge background has to satisfy the DUY equations in order to guarantee that a solution to 
the underlying Hermitian Yang-Mills equations can be found. The tree-level DUY equations without 
additional VEVs read 


Vj Vol(Dj) = (1 , ( 17 ) 

for all I where the divisor volumes Vol(-Dj), defined in ([ 3 ]), will be always taken deep inside the Kahler 
cone to ensure that our geometric description makes sense and that the supergravity approximation 
is valid. Dotting © with Vi for every 7 = 1 ,..., h\\ we obtain a necessary and sufficient condition for 
the DUY equations in terms of the Gram matrix, 


Kij \ol(Dj) = 0 . ( 18 ) 

In this paper we do not consider the one-loop correction to DUY equations. The main reason for 
this is that for the non-supersymmetric SO( 16 )xSO( 16 ) theory it is unclear what the order one-loop 
corrections to DUY are. Since with the present work we aim to compare the three heterotic theories 
side-by-side, we also refrain from including the loop corrections for the supersymmetric heterotic 
theories as well. 

3.3 Equivariant line bundles and Wilson lines 

By construction, CICYs have a trivial fundamental group. It can become non-simply connected by 
modding out some compatible freely acting symmetry (which does not introduce any fixed points). In 
this paper we focus on Zv symmetries, i.e. one-generator Abelian symmetries of order N. 

Generically, such a symmetry consists of introducing phases and permuting homogeneous coor¬ 
dinates inside the ambient P N factors. It could also permute P N factors, but will never mix the 
coordinates among different P N factors. In the ambient space divisor basis {Di}, the first two actions 
do not pose any additional constraint. In contrast, if two P N factors are permuted, the corresponding 
divisors are also interchanged. In the latter case, we have to ensure that our line bundle background, 
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constructed on the initial basis {Di} (in the so-called upstairs picture) is still a valid gauge background 
after the discrete symmetry is modded out (in the so-called downstairs picture). This compatibility 
of the line bundle background with the symmetry action of the underlying geometry is called equiv- 
ariance. 

We restrict ourselves to modding out discrete symmetries of the lowest order possible: For the 
subset we consider in Table [l] these are Z2 actions. In all but one case the equivariance constraint is 
trivial, i.e. no new condition arises for our bundle vectors V). Only for CICY 6225 we have to impose 
V4 = V5 to ensure equivariance of the line bundle background, since the Z2 action maps D4 ■H- H5. 
The freely acting Z2 symmetry admitted by the Schoen manifold also imposes non-trivial conditions 
on the 19 bundle vectors, see Ref. (31 for details. 

4 Automated construction of smooth compactification models 

In this section we describe our procedure to perform model scans for smooth Calabi-Yau compactifica- 
tions with line bundle gauge fluxes of the three heterotic theories. The phenomenological explorations 
of the supersymmetric SO( 32 ) theory and the non-supersymmetric SO( 16 ) xSO( 16 ) string on smooth 
Calabi-Yau geometries have not been considered in the past. The E§xEg string models are rather 
well established, therefore, we use EgxEg model building to cross check our methods and computer 
codes. The final aims of our studies are the following: 

• Generate four-dimensional models on smooth geometries for any of the three heterotic strings 
in an automated way. 

• Search for MSSM-like models for the EgxEg and SO( 32 ) theories. 

• Search for tachyon-free SM-like models for the SO( 16 )xSO( 16 ) theory. 

• Establish estimates of how fruitful and phenomenologically relevant each theory might be on a 
given geometry. 

In order to perform such investigations in an automated fashion we essentially go through three stages: 

1. Generation of the model input data 

Starting from a given heterotic theory and geometry we generate a large collection of compatible 
bundle vectors. 

2 . Computation of the resulting spectrum 

For each of the corresponding models we compute the full charged massless chiral spectrum and 
check whether it is free of anomalies. 

3 . Analysis of the phenomenology 

Based on these spectra we investigate some of their elementary phenomenological properties, 
e.g. to what extend the massless spectra can be related to (MS)SM physics. 

In the following subsections we describe each of these steps in more detail. 
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Calabi-Yau 


Bianchi 

geometry 


identities 


\ 



Gram 


Line bundle 

l 

matrices 


vectors 


Kahler 


DUY 

cone 


equations 




Figure 1 : This diagram displays the steps involved in going from an input geometry to a full specifi¬ 
cation of the upstairs line bundle model. 


4.1 Generation of the model input data 

We focus on line bundle backgrounds on smooth Calabi-Yau manifolds as discussed in Section [ 3 } To 
generate models we make use of the description spelled out there: Instead of directly attempting to 
find a set of line bundle vectors that satisfy the Bianchi identities and then check whether they fulfill 
the tree-level DUY equations, we mostly try to formulate the problem on the level of the Gram matrix 


K defined in ( 12 ). 


For a given K we can construct a set of bundle vectors that lead to this Gram matrix, which then 
fully define an upstairs model, i.e. before modding out any freely acting symmetries. The procedure 
is summarized in the diagram in Figure [lj Even though this diagram indicates the logical order of 
the various construction steps, the use of the Gram matrix means that some steps can be performed 
independently of each other and therefore in parallel. In particular, at the level of K , the consistency 
conditions displayed in Table [ 3 ] are identical for all three heterotic strings. Therefore, finding an 
admissible set of Gram matrices is a theory-independent process. Before we discuss the various steps 
in more detail, we should also stress that Figure [l] indicates the generic steps involved. For some 
geometries certain steps can be optimized or even by-passed. We will discuss the cases where we have 
done so below. 


Generating the Kahler cone 

Since we do not know how to efficiently solve the divisor volume constraints, we simply construct 
the Kahler cone for the manifolds under investigation explicitly. More precisely, we generate a large 
number of points inside the Kahler cone. For the geometries where we have the explicit parametrization 
of the Mori cone in a minimal basis as aj > 0 , we do so by running through a finite integer range for 
the Kahler parameters and normalizing the resulting h\\ volumes Vol(-Dj) by their greatest common 
divisor. 


Generating Gram matrices 

Solving the Bianchi identities and DUY equations is essentially immediate once one has specified the 
geometry and a point in the Kahler cone. Hence, we can turn to the construction of the Gram matrices. 
The Gram matrix has /in(/in + 1)/2 independent entries. Generically, the Bianchi identities ( 16 ) and 
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the DUY equations ( 18 ) fix 2 h\\ entries, such that we have /in (/in — 3)/2 independent components 
in general. 

For many geometries under consideration, the Bianchi identities only depend on a small number 
of linear combinations of the diagonal entries Ka of the Gram matrix. Hence, in these cases it is 
beneficial to solve mainly for these diagonal entries in terms of the off-diagonal entries Kjj, i ^ j, 


using the DUY equations ( 18 ): 


Ka = ~ 


1 

Vol(A) 


E Vo1 (Dj)K i: 


( 19 ) 




Since we have generated the Kahler parameters a* such that we are inside the Kahler cone, dividing 
by any volume Vol(-Dj) never poses a problem. 

In relatively rare cases the Bianchi identities and the DUY equations are partially linear dependent. 
This happens when the combined equations do not have maximal rank 2 /in. In this case the Bianchi 
identities dictate some combinations of the divisor volumes, which could lead to solutions outside of 
the Kahler cone and consequently outside of the validity of our approximation. 

As emphasized in 156 ], there seems not to exist a clear bound for the range of the entries Kij left 
undetermined by the simultaneous solution of the Bianchi identities and the DUY equations. This 
range needs for sure to be at least such that the second Chern class contribution with mostly C2i > 0 
can be compensated. In practice, the possible values of K-matrix elements have to be taken from 
a finite range. To be able to solve the combined system efficiently and fast, we choose this range 
according to the number of free parameters that we have to scan over while inspecting the Bianchi 
identities. The more adjustable entries there are, the smaller the range practically has to become. 


Generating bundle vectors 


For a given Gram matrix AT, we can always construct a collection of line bundle vectors {Vj} on the 


appropriate lattice. For this it is crucial that K obeys the conditions ( 13 ): Most randomly generated 


matrices cannot be written as inner products of vectors as given in ( 12 ). The I\ -matrix construction 


ensures that a set of linearly depended line bundle vectors {Vi} always exists such that the integrated 
Bianchi identities and the tree-level DUY equations are solved inside the Kahler cone. Once we found 
a set of admissible /\-matrices, we make use of essentially two algorithms to determine a generating 
set of line bundle vectors {Vi} on the appropriate lattice: 

The first approach starts from a vector V)° (i = 1 ,..., /in) and tries to find lattice directions which 
minimize / = U; • U) — Kij\ . This is done by adding or subtracting simple roots from V® until 

a preferred directions has been singled out. Since we are working on a lattice, we expect that at some 
point / = 0 . Although this algorithm is extremely fast and efficient for small values of I\ij , it quickly 
becomes rather slow for larger norms (e.g. for Ka > 10). 

For the second approach, we randomly pick vectors of given norm Ka and try to fix the off-diagonal 
scalar products through an adjustable trial-and-error procedure. Concretely, we generate two 
vectors Vj and V2 with V} 2 = K\ \ and V 2 = K'22- Then we check whether V2 ■ V\ = A21. If this is not 
the case, a new U 2 2 = A'22 is generated and re-checked until the matching scalar product V± ■ V2 is found. 
We proceed analogously for the other bundle vectors. A potential deficiency of this algorithm is that 
it might fail to converge: When i vectors have been successfully constructed, it is not guaranteed that 
a vector V)+1 with V 2 +1 = Ki + ij + \ exists that produces the corresponding AT-matrix elements ATj+ij 
with j = Therefore, the algorithm tries to statistically establish (based on time optimization 
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criteria) the maximum number of randomly generated vectors, before the whole ansatz (including 
the i successfully constructed vectors) is abandoned. Once hn — 1 vectors have been successfully 
determined, the final one is computed via the tree-level DUY equations using ( 19 ). 

In practice, we need to generate line bundle vectors for a given it-matrix in a reasonable amount 
of time (i.e. less than ~ Is). Hence, we are obliged to keep the it^’s sufficiently small. Testing 
both approaches, we find that the second procedure is significantly faster in generating admissible line 
bundle backgrounds, allowing for bigger values of it-matrix elements as well as higher h\\ (Kij < 20 
and /in < 6 on favorable CICYs) without significant loss of efficiency. 


4.2 Computation of the four-dimensional spectrum 
Unbroken gauge group 

The unbroken gauge group in four dimensions is determined by the subgroup of the 10 D gauge group 
that commutes with the line bundle background <©■ Denoting the roots of the ten-dimensional gauge 
group by a, the associated unbroken generators in four dimensions are determined by 


a ■ Vi = 0 


Vi = 1,. 


,h 


u 


( 20 ) 


Furthermore, as discussed in Section 3 . 3 , on non-simply-connected CYs we can turn on non-trivial 
gauge backgrounds with zero field strength, i.e. Wilson lines. We focus on freely acting symmetries 
of the underlying geometry. Then, the corresponding Wilson line W has to be of the same order, 
NW 6 A with A defined in ( p~T| ) . In addition to the projection conditions (20) due to gauge fluxes, 
the generators of the surviving group now have to be invariant under the action of the freely acting 
Wilson line W, 


a ■ W = 0 mod 1 . 


( 21 ) 


Charged chiral spectrum 

Subsequently, the charged massless spectrum in four dimensions is computed using the multiplicity 
operator. The charged massless states in ten dimensions are characterized by certain sets R of lattice 
vectors p G A and the ten-dimensional chirality “spin” s for fermions (for bosons we simply take 
s = 1 ). This is in particular important for the non-supersymmetric SO( 16 ) xSO( 16 ) theory which 
contains fermionic states of both chiralities. We take the chirality of the gauginos of the EgxEg 
theory positive and those of the SO( 32 ) theory negative. The relevant weights p (with p 2 = 2 ) of the 
various representations for the three heterotic theories are indicated in Table [ 4 j 
The multiplicity operator evaluated on such states takes the form 


Af = A f(p; s) = ^ K ijk {p ■ Vi)(p ■ Vj)(p • 14 ) + (p ■ V) . 


( 22 ) 


For details on how this formula is obtained, see e.g. 55 . Since it counts the number of chiral states 


we can take those p for which J\T > 0 without loss of generality; states with AT < 0 then corresponds 
to the CPT conjugates of the former. In [ 9,110 ] it is explained why this formula can also be used 
in the non-supersymmetric context to compute the massless spectra of both bosons and fermions 
provided that the geometrical background itself is Calabi-Yau. Finally, to obtain the chiral spectrum 
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Weights of the 10D massless charged heterotic states: ( p 2 = 2) 

State 

N—1, EgXEg 

N=l, SO(32) 

N=0, SO(16) xSO(16) 

Gauge 

(±1,±1,0 6 )(0 8 ), (-i 2fc ,i 8 - 2fc )(o 8 ) 

(±1,±1,0 14 ) 

(±1,±1,0 6 )(0 8 ) 

bosons 

(0 8 )(±i,±i, 0 6),(0 8 )(-i 2fc ,i 8 - 2fc ) 


(0 8 )(±1,±1,0 6 ) 

Pos. chiral 

(±1,±!,0 6 )(0 8 ), (-f/^HO 8 ) 


H 2fc ,i 8 - 2fc )(o 8 ) 

fermions 

(o 8 )(±i,±i,o6),(o 8 )(-i 2 M 8 - 2fc ) 


(o 8 )(-i 2 M 8 - 2fe ) 

Neg. chiral 

fermions 


(±1,±1,0 14 ) 

(±1,0 7 )(±1,0 7 ) 


Table 4 : The weights of the charged massless ten-dimensional states of the three heterotic theories 
are listed. Underlined entries means permutation of the corresponding entries. A power of an entry 
means repetition of this entry and 0 < k < 4 . We have used the convention that the chirality of 
fermions of the supersymmetric EsxEs and SO( 32 ) theories are opposite; the non-supersymmetric 
SO( 16 )xSO( 16 ) theory contains fermions of both chiralities. 


in the downstairs picture after modding out a freely acting Zjy symmetry, we simply need to divide 
all upstairs multiplicities by the symmetry order N. 

In order to ensure that our construction is consistent, the chiral spectrum is under the scrutiny of 
the following consistency checks: 


Integral multiplicities 

The multiplicity operator should always return integral values evaluated on all states. Given that 
factors of 1/6 and 1/12 occur in (22), this provides a non-trivial cross-check on whether the flux 
quantization conditions ( 10 ) have been implemented properly. Moreover, considering a geometry that 
admits a freely acting symmetry and choosing an equivariant line bundle (as described in section 3 . 3 ), 
the multiplicities should still be integral after modding out the freely acting Wilson line. 


Anomaly cancellation 


The second consistency check is the absence of anomalies induced by the chiral fermionic spectrum in 
four dimensions. These anomaly checks involve the generalized Green-Schwarz mechanisms as worked 
out in [101 for all three heterotic theories (inspired by the works of 


57 , 581 ). The information that 


enters the Green-Schwarz anomaly cancellation is schematically indicated in Figure [ 2 ] at the level of 
our input data and the resulting model construction. Consider the situation with an unbroken gauge 
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Figure 2 : This diagram displays the information entering the anomaly checks. 


group with Nq simple non-Abelian gauge group factors and Njj U(l)-factors, 


No _ 

H G * X U(l) Nu , With rk ( G *) + N U = 16 • ( 23 ) 


Since the total rank of the gauge group is sixteen in the heterotic theories, the second relation in 
equation ( 23 ) follows, where rk(Cr x ) denotes the rank of the x-th simple group factor. Anomaly 
cancellation then leads to 


#(checks) = N u + N g + N u N g + N u + Nu(Nu - 1 ) + \ !%(!% - 1 )(N V - 2 ) ( 24 ) 

b 

independent consistency conditions. The different terms count the number of different anomaly con¬ 
ditions 


® G^ 

• grav-grav-U(l) 7 

. G x -G x -XJ(iy 

• U(l) 7 -U(l) 7 -U(l) 7 

• u(i) 7 -u(i) J -u(i) J , iyj 

• U( 1 ) 7 -U( 1 ) J -U( 1 ) X , 

which are all checked in our automated searches. (Since the index I runs over the full Cartan, i.e. from 
1 to 16 , and not over the Abelian factors only, there is some redundancy in the anomaly checks we 
have implemented.) The absence of pure non-Abelian anomalies is checked purely at the level of the 
four-dimensional chiral spectrum, without the need to explicitly provide any additional information on 
the geometry or the heterotic theory under consideration. On the other hand, the precise expressions 
for the pure and mixed Abelian anomalies involve the generalized Green-Schwarz mechanism and are 
therefore theory-depended and require explicit details about the compactification geometry and the 
gauge background. 

The set of conditions for the three heterotic theories, the supersymmetric EgxEg, SO( 32 ) and 
the non-supersymmetric SO( 16 )xSO( 16 ) are explicitly stated in Tables EE and[ 7 j respectively. For 
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the pure non-Abelian anomalies the conditions listed in these tables only correspond to the anomaly 
cancellation in the Cartan parts of the simple gauge group factors. We check, however, the full 
non-Abelian anomaly cancellation in each of these factors, as stated previously. For the EsxEs and 
SO( 16 ) xSO( 16 ) theories we have to make a distinction between the cases where different U(l)s reside 
in the same or different ten-dimensional gauge group factors. As we have stated above throughout 
this paper we assume that no (anti-)NS 5 branes are present, which simplifies the expressions in these 
tables. 

In Tables [ 5 ]j 7 ] the following notation for an irreducible representation R = (Ri,..., Rjv g ) of 
the unbroken gauge group ( 23 ) has been introduced: R r , x = 1 ,...,Nq, denotes an irreducible 
representation of dimension dimR 2 . under the simple non-Abelian group factor G x . The index ^(R x ) 
of the irreducible representation R x is given by 


<(RJ == C(R *> 


( 25 ) 


where C(R X ) is the quadratic Casimir of the algebra Ad^, associated to simple gauge group factor G x 
evaluated on the representation. In particular on the fundamental (vector) representations N and 2N 
of SU(IV) and SO( 2 IV)-groups we have £(N) = 1 and ^(2N) = 2 , respectively. In addition, we define 
the multiplicity 


n T (R) = AA(R) dimR 

y^x 


y > 


( 26 ) 


w.r.t. a non-Abelian factor G x of the irreducible representation R, where AA(R) denotes the value of 
the multiplicity operator ( 22 ) evaluated on any of the weights of R. (For further details see e.g. [59 
and Appendix A of 10 .) To describe the G x x G x X U(l) Nu anomaly we find it convenient to choose 


a particular U(l) basis given by vectors r a , with a = 1 , ... ,Njj and define the charges = t a ■ pr 
where pr is a representative of the weights describing the representation R. On the right-hand-side of 
the anomalies in Tables [6] and [T] we also need the branching of the fundamental of SO( 32 ) or SO( 16 ). 
We denote the resulting fundamental representations of the non-Abelian group factors G x by r x and 
use w G r x to refer to their weights. 


As observed in 10 , the non-supersymmetric SO( 16 ) xSO( 16 ) theory is in some sense in between the 
supersymmetric EsxEs and SO( 32 ) theories. In particular, if one takes the ten-dimensional chiralities 
of these two supersymmetric theories opposite to each other, the addition of their X$ factors appearing 
in the factorized anomaly polynomial 112 = X4W precisely coincides with the factors of the non- 
supersymmetric SO( 16 )xSO( 16 ) theory. Indeed, using Tables H!l it can be readily confirmed that 
for the SO( 16 )xSO( 16 ) theory, the Green-Schwarz-terms on the right hand side of the equality are 
obtained by subtracting the corresponding contributions of the SO( 32 ) theory from those of the EsxEs 
theory. This verifies the consistent dimensional reduction of the various anomaly polynomial factors 
A 8 . 
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Supersymmetric EgxEg theory 

grav — grav — U(l/ 

Ep A/ - P 1 ' = V/'(s ■ E') + |c 2l ) 

G X ' ~ G x , - U(l) a ' 

— 2 G ' [ K ijk(Vj • kfc) + C2 *) 

Gx , - G x , - U(l) a " 

£ R n x KRK(R,')</f - 0 

U(l) 7 ' -U(l) 7 ' -U(l) 7 ' 

EpAf/VV' = vr( Kijk $v!-v{ + vf'v*') + |c 2i ) 

U(l) 7 ' - U( 1 ) J ' -U( 1 ) J ' 

r ± j' 

Ep A WV' = E/' [n ijk {\v^ ■ E' + E/e/) + fa) 

U(l) 7 ' - U( 1 ) J " - U( 1 ) J " 

Ep A/” p 1 ' p J " p J " — 0 

U(l) 7 ' - U( 1 ) J ' - U( 1 ) A ' 

V + J' + K' ± V 

EpA/y' V 3 ' P K ' = KijkV/'vfvf 

U(l) 7 ' - U( 1 ) J " - U( 1 ) A " 
J" ± K" 

J2 P Af p 1 ' p J " p K " = 0 


Table 5 : This table presents the pure and mixed Abelian anomaly cancellation checks on the four¬ 
dimensional charged chiral spectrum obtained from line bundle Calabi-Yau compactifications of the 
supersymmetric EgxEg theory. All sums are only over those weights with M = AT(p;s) > 0 . In 
addition, also the expressions with ' —, interchanging the observable and hidden sectors, are checked. 


Supersymmetric SO(32) theory 

grav — grav — U(l) 7 

-EpA fp 1 = E /(4 KijkVfVi + lcn) 

G x — G x — U(l) a 

-Er^(R)^(Rx)^ - 1 sEit I aKi j kV/V/Vi+ 

+t a ■ Vi(± C 2i + T, W€ V X t( r x) (w ■ Vj)(w ■ 14 )) 

U(l) 7 -U(l) 7 -U(l) 7 

- Ep a/VpV - e 7 (4 KijkV/Vjt + loti) 

U( 1 ) 7 -U( 1 ) J -U( 1 ) J 

I + J 

- Ep A/y P J p J = V/ (k ijk (3E/E 7 + E/E/) + ±c 2i ) 

U( 1 ) 7 -U( 1 ) J -U( 1 ) A 
I+J+K+I 

Ep A7p 7 p J p K — 0 


Table 6: This table presents the pure and mixed Abelian anomaly cancellation checks on the four¬ 
dimensional charged chiral spectrum obtained from line bundle Calabi-Yau compactifications of the 
supersymmetric SO( 32 ) theory. All sums are only over those weights with J\f = A f{p\ s ) > 0 . 
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Non-supersymmetric SO(16)xSO(16) theory 

grav — grav — U(l) 7 

EpA fp 1 ' = V/'(K ijk {3(V!-V^-W/'V/'V^)+2c 2i ) 

Gx , _ Gx , _ u(i)“' 

Er n *'(R) ^(Rx') Qr = -|E I'ta-KijkV/'vfvi' +t a ' .V'(lc 2i + 

+ fcjk Vj ■ E- - E^r,, *0v) G p • V’)(p ■ v$) 

Gx , - G x : - U(l) a " 

Er MR) ^(RxO q a n = -3 E/« CK ijk v/"vfvf - t a " ■ Y/'O c 2i + 

+ E w er x , ^Gv) ( w ■ V j)( w ■ V k )) 

T—1 

P 

1 

T— 1 

P 

1 

1—1 

E P A w - Vi' ( Kijk (lv; ■ VI - 3 vfvi') + c 2i ) 

U(l) r - U(1) J ' - U(1) J ' 

/' ^ J' 

Ep A fp 1 ' P J ' P J ' = Vi' fak^V! ■ VI - \vfvi') + i C2l ) 

t-H 

P 

1 

N 

T— 1 

p 

1 

T— 1 

Ep A/y W" = -Vi' (nijk (\vfvi' + Y/V/") + lc 2 ij 

U(l) r - U(1) J ' - U(1) A " 

V + J' + K' ± V 

EpA yyy' = K ijk vi'v/'v k K ' 

U(1) J ' - U(1) J " - U(l)*" 
J" ± K" 

EpA rp J 'p J "p K " = 0 


Table 7: This table presents the pure and mixed Abelian anomaly cancellation checks on the four¬ 
dimensional charged chiral spectrum obtained from line bundle Calabi-Yau compactifications of the 
non-supersymmetric SO(16 )xSO( 16) theory. All sums are only over those weights with J\f = J\T(p; s) > 
0. (In addition, also the expressions with ' —t", interchanging the observable and hidden sectors, are 
checked.) 

4.3 Analysis of the phenomenology 

Next, we introduce a couple of definitions to characterize smooth models and their 4D spectra. These 
will be used in the subsequent section to present and analyze our model-building results. 

Model classification criteria 

We search for models that are close to the MSSM or the SM depending on whether model building is 
performed in the supersymmetric or non-supersymmetric context. For this purpose, a smooth model 
in 4D (constructed on a CY X ) is solely defined by 

• the full gauge group, i.e. its non-Abelian gauge symmetries and U(l)’s, 

• the chiral massless spectrum (bosons & fermions), 

• its line bundle vectors. 

At the next level, which goes beyond the scope of the present work, further elements of the effective 
field theory analysis in four dimensions, such as admissible D- and F-flat VEV configurations and 
(Yukawa) couplings, could be naturally added to the main definition. 
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In this paper, we have chosen to obtain (MS)SM physics through an SU( 5 ) GUT theory. Therefore, 
we introduce the notion of GUT-like model for a smooth model when 

• there is an unbroken SU( 5 ) GUT group allowed by the line bundle background 

• there are representations available that admit the interpretation of SM quarks and leptons, i.e. 
5 - and 10-plets in the SU( 5 ) case. 

Also the definition of GUT-like models can be naturally extended to SO(IO) or Eg GUTs, which we 
do not consider in the present work. Analogously, we shall call a model (MS)SM-like when 

• the unbroken gauge group contains the SM gauge group Gsm =SU( 3 )xSU( 2 )xU(l)y 

• there is a net number of three chiral generations of quarks and leptons and at least three chiral 
singlets under Gsm to accommodate the right-handed neutrinos 


• all exotic fermions are singlets w.r.t. the SM gauge group. 

Since (MS)SM-like models in the downstairs picture are obtained from an upstairs GUT-like model 
via a freely acting Wilson line which breaks the GUT gauge group non-locally, the standard SU( 5 ) 
normalization of the non-anomalous hypercharge Y is automatically obtained in all the line bundle 
models. 

Finally, we refer to a GUT-like model as chiral exact if the chiral number of 5 - and 10 -plets is 
exactly 3 |T|, before modding out the freely acting symmetry T. In other words, in this case there 
are no additional vector-like 5-5 or 10-10 pairs in the chiral spectrum detected by the multiplicity 
operator ( 22 ). Similarly, we call a spectrum (MS)SM-like chiral exact when the chiral number of 


generations of quarks and leptons is precisely three. 

There are a couple of things to notice about these definitions. First of all, by definition a chiral 
exact GUT- or (MS)SM-like model cannot have any exotic fermionic states charged under SU( 5 ) or 
Gsmi respectively. On the other hand, it could well (and most probably will) have chiral non-Abelian 
representations transforming under a hidden group G , which will be singlets under SU( 5 ) or Gsm • In 
addition, there will be various singlets charged under the various (hidden) U(l)’s. Secondly, starting 
from a specific GUT-like theory in the upstairs picture, there are generically various (MS)SM-like 
theories with different downstairs spectra, corresponding to the various ways in which the hidden 
gauge group can be broken and accordingly branched. In addition, there may be inequivalent ways to 
embed Gsm inside SU( 5 ), leading to distinct downstairs (MS)SM spectra as well. 

Finally, in the present context, the multiplicities of 4 D states are defined from the perspective of 
the GUT or Gsm group. This means that fields that transform in a d-dimensional representation 
of an additional hidden gauge group are counted d times. For example, consider a state 4 ( 3 , 1 ; 3 ) 
where 4 is the multiplicity of this smooth state as computed by the multiplicity operator, ( 3 , 1 ) is 
the representation after branching the GUT group to Gsm (suppressing U(l) charges) and ( 3 ) is 
the dimension of the representation under some hidden surviving SU( 3 ) symmetry. Using the above 
definitions, such a state would correspond to IV = 4 • 3 = 12 times the SM state ( 3 , 1 ) from our 
four-dimensional perspective. 


Inequivalent models 

After we have generated a large collection of GUT-like or (MS)SM-like models on a certain geometry 
we want to get a crude feeling for their phenomenological properties and analyze them statistically. 
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We have to check whether all the models (not only GUT- or MSSM-like) we have generated on a 
given geometry are really distinct. Indeed, having exactly the same model appear over and over 
again in our list might lead to a misguided interpretation of statistical results. In particular, ignoring 
such equivalences in the spectra could have the effect that not all smooth models appear with equal 
probabilities in our statistics. For instance, a bundle background corresponding to a dense K- matrix is 
for sure much more unlikely to be constructed than another bundle with a sparse K -matrix. Moreover, 
since there are various non-linear and iterative steps in our scans, such inequivalence tests are crucial. 

In fact, even before that, we have to recall from Section [ 2 ] that there are geometries with different 
configuration matrices which are nevertheless identical on the level of the geometrical input data we 
make use of, i.e. intersection numbers and second Chern classes. For such groups of manifolds, like 
e.g. CICY 6715 , 6788 , 6836 , 6927 , we have only scanned over and investigated their model building 
potential onceQ 

There are two approaches to define in which cases two models are considered inequivalent on a 
given manifold: 

i. ) Two models are considered inequivalent when they have different chiral spectra up to their U(l) 

charges. 

ii. ) Two models are considered inequivalent when they are described by different line bundles. 

The first approach has often been employed when scanning phenomenological orbifold models. In that 
context, it was avoided to explicitly compare U(l) charges, since in that case one has to consider all 
possible bases for these Abelian gauge symmetries, which is a computationally cumbersome procedure. 

In the second approach we consider line bundle vectors to be distinct when there are no obvious 
symmetries that can map the two line bundle backgrounds onto each other. Hence, one should be 
careful to filter out as many symmetries as possible. There are basically two types of permutation 
symmetries to be considered in this context: Permutation of bundle vector entries Vl ■H- and 
permutation of fluxes V) -H- Vj on divisors Di , Dj which have the same Chern classes and whose 
intersection numbers with each other and all other divisors are also the same. Despite taking such 
permutation symmetries into account, huge lists of inequivalent models are generated already in the 
upstairs picture. That is why in practice, before applying the freely acting symmetry, we have only 
kept lists of inequivalent models containing an SU( 5 ) gauge group, see also the relevant statistics of 
section 15.31 

Albeit seemingly unlikely, it happens quite frequently that two distinct sets of line bundle vectors 
lead to an identical (massless) spectrum. Because of this, the classification approach[Ij) leads to much 
fewer inequivalent models. However, even if the low-energy spectra of two models with distinct line 
bundles are identical, their detailed phenomenology might still be different, since, for example, their 
(Yukawa) couplings are most likely distinct. For this reason we should consider two models to be 
inequivalent when either of the two approaches classify them as being inequivalent. 

In the upstairs picture, approach [nj) implies approach [IJ). So it is enough to keep a list of inequiv¬ 
alent line bundle backgrounds on a given geometry. Concerning the equivalence between downstairs 
spectra, there is a subtlety to be noted. Since the Wilson line projection condition is defined modulo 
lattice vectors, it does not allow for a good classification of distinct models. On the other hand, 
we know from our previous discussion that the same upstairs smooth model generically corresponds 

Ht is well possible that under more detailed phenomenological analyses models on such seemingly equivalent geometries 
might behave differently. 
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Symrnc 

Bianchi identities 

;tric K3-like CICY 7862 

Tree-level DUY 

A'14 = -12 - A'12 - A'13 

vi Ku = 12u 4 + (v 4 - u 2 )A'i2 + (u 4 - u 3 )ATi 3 

A23 = A '14 

v 2 K 22 = 12v 3 + (v 3 - vi)K 12 + (v 3 - v 4 )K 13 

co 

£ 

II 

v 3 1< 33 = 12 v 2 + {v 2 - u 4 )A'i2 + (v 2 - vi)K ± 3 

A34 = Ah 2 

V 4 K m = 12vi + (ui - v 3 )Ki 2 + (m - u 2 ) AT13 


Table 8: This table gives an explicit solution of the Bis in terms of Kij for CICY 7862 (with /in = 4 ) 
from the special class of favorable symmetric K 3 -like CICYs. In the second column the norms of bundle 
vectors are constrained via the DUY equations. There are two free entries (A'12 and A'13) that 
remain unconstrained by both the Bis and the DUY equations. We have abbreviated Vol(-Dj) = V{. 


to various distinct downstairs spectra. It is thus necessary to combine both approaches [IJ) and i) 
to classify models in the downstairs picture according to their bundle realization as well as their 
downstairs massless spectrum. 

4.4 Generating line bundles on specific smooth manifolds 

In this section we describe our model building and classification procedures for a couple of geometries 
(in particular those presented in Section [2]) in a bit more detail. We stress though, that our methods 
are more general and can in principle be applied to any smooth manifold. The purpose of this section 
is to illustrate these methods in concrete examples and prepare the setting for the actual model scans. 
On the other hand, we would also like to emphasize that it is sometimes beneficial to modify or by-pass 
certain steps. In particular, the structure of the Schoen manifold leads to various simplifications in 
the model building process. 


Generating line bundles and classifying models on favorable CICYs 

In this paragraph we focus on a specific type of favorable CICY geometries, i.e. CICYs 7862 , 7447 , 
7487 and 5302 , which contain intersecting K 3 manifolds. Using the intersection numbers of Table [lj 
their Bianchi identities written in terms of the Gram matrix K can be compactly summarized as 


\tijk\Kjk — 24 , 


( 27 ) 


for 1 < i < /in using the standard definition of the totally antisymmetric tensor eijk ■ Consequently, 
these equations do not restrict the diagonal entries Ka = V - 2 of the Gram matrix. Hence, only the 
tree-level DUY equations impose constraints on the norms of the line bundle vectors. 

Let us demonstrate this observation explicitly for such a K 3 -like symmetric CY with /in = 4 , 
CICY 7862 , also called the tetra-quadric (cf. Table [8]): As outlined in Section 4.1 we first generate a 
large collection of points inside the Kahler cone. Next, we use the linear system of equations imposed 
by the Bianchi identities to solve AY 14, K 2 4, K 2 a, A34 in terms of K 12 and A'13, cf. the hrst column of 
Table [8| In the right column, we then incorporate the relations derived from the Bianchi identities into 
the DUY conditions ( 19 ). This results in only two independent Gram matrix entries, A'12, A'13, out of 
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CICY # 

Necessary conditions to be inside the Kahler cone 

7862 

Vol(A) < Vol(-Dj) + Vol(-Dfc) + Vol(A) for i. j. k, l € {1,..., 4} all different 

7491,7522 

Vol(A) + Vol(A) < Vol(A) < 2 £c=i Vol(A) for a, b G {1, 2,3} 

7447, 7487 

Vol(A) < Vol(Dj) + Vol(-Dfc) + Vol(A) for i, j. k, l G {1,..., 5} all different 

6770 

Vol(A) < Vol(A) + Vol(A) + Vol(As) for a / b E {1, 2, 3} 

2Vol(.D a ) < Eti Vol(A,) for a = 4, 5 

6715,6788 

6836,6927 

Vol(A) + Vol(A) < Vol(A) < Ec=i Vol(G c ) 

Vol(A) < Ea/c= 1 Vol (^c) for a, b G {1,..., 4} 

6732,6802 

6834,6896 

Vol(A) + Vol(A) < Vol(A) < Eli Vol(A) 

Vol(A) < EU=i Vo1 ^ c ) for « e {i,... ,4} 

Vol(A) < Vol(A) for b = 3,4 

6225 

Vol(A) + Vol(A) < Vol(T> 4 ) + Vol(A) for a, b € {1, 2, 3} 

Vol( A) + Vol(A) < 3 Ec=i Vol (^ C ) 

Vol(A) < Ed /C =i Vol(A) for d = 4,5 

5302 

Vol(A) < Vol(A) + Vol(A) + Vol(A) for i,j , k, l G {1,..., 6} all different 


Table 9 : This table presents necessary conditions on the divisor volumes Vol(A) in order to be inside 
the Kahler cone for the various geometries of Table [lj These conditions are derived using the definition 
of divisor volumes ([ 3 ]) and the appropriate intersection numbers. For all favorable CICYs presented 
here the volumes of curves dual to the divisors A and of the full Calabi-Yau are positive iff at > 0 . 


the initially = 10 independent components; all other entries are given by the two off-diagonal K^j 
and ratios of divisor volumes Vol(A). 

Subsequently, we need to go through the various points inside the Kahler cone and search for 
particular values of /\ 4 2, /T13, which support non-negative even integer K„ for all i = 1 ,..., h\\. This 
last step is far less trivial than one might initially think: The volumes Vol(A) are not some randomly 
adjustable parameters, but are (at least) related through the inequalities stated in Table [ 9 j For 
instance, on the tetra-quadric no volume can become very big compared to the other three, since it is 
maximally constrained by the sum of the other volumes. 

In total, we see that using the strategy presented in section | 4 . 1 | with some elementary linear 
algebra, we have managed to reduce a highly coupled non-linear Diophantine system of equations 
in the bundle vectors entries V/ to simply scanning over integer values for the two unconstrained 
Gram matrix entries and performing some simple combinatorics. Since all the necessary steps have 
been performed in a theory-independent way, one can use the procedure (described in section | 4 . 1 [ ) to 
generate vectors on the appropriate lattice for any of the heterotic theories. 

To classify inequivalent models in the upstairs picture we employ classification approach [ij) as 
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/in = 4 

Favorable CICYs 

5 

6 

Number of self- 

intersecting divisors 

7862 

7447, 7487 

6770 

5302 

0 

7491, 7522 

6715, 6788, 6836, 6927 
6732, 6802, 6834, 6896 


1 


6225 


2 


Table 10: The favorable CICYs considered in our scans, are grouped according to the number of self- 
intersecting divisors they possess. This classification is helpful to filter out permutation symmetries 
among equivalent bundle vectors supported on these divisors. 


discussed in Section |4.3| It is crucial to filter out various permutation symmetries in the bundle data 
used to classify model equivalence: To systematically filter out permutation symmetries among vector 
entries Vl ■H- Vu we have implemented a geometry-independent routine. In contrast, one should be 
more careful when considering the second type of permutation symmetry, i.e. interchanging fluxes 
Vi o Vj on divisors whose intersection numbers and Chern classes are the same. Since the statement 
is obviously geometry-depended, one needs to perform a pre-analysis for each geometry in order to 
see for which divisors this is the case. For instance, we immediately see that in our specific example, 
CICY 7862, all divisors are K3-surfaces, which yields 4! = 24 different possibilities to rearrange the 
exact same bundle background by permuting all four Vi . For a faithful phenomenological analysis 
only one of those equivalent backgrounds has to be stored. 

Fortunately, for the limited set of favorable CICYs we consider in this paper, there is a clear 
pattern for the various divisors Vol(-Dj) and their triple intersection numbers ■ Concretely, we 
find that for most of these geometries, many (if not all) divisors do not have any self-intersections (i.e. 
Df = 0) at all. For this subset of divisors the geometry looks essentially identical to the K3-like CICYs 
we have just investigated. For the self-intersecting divisors (i.e. Df 7 ^ 0) the situation deviates from 
our previous discussion. Usually there are one or two self-intersecting divisors (which tend to have 
the same Chern classes and intersection numbers among themselves). This motivates the following 
strategy: We group the CICYs we want to scan according to the number of self-intersecting divisors 
they have, as done in Table 10 For a given group of manifolds the properties under the exchange of 


fluxes on divisors with the same C 2 * and Kijk are mostly identical or at least very similar. 

Generating line bundles on the Schoen manifold 


The Schoen manifold was introduced in Section 2.2 In order to generate chirality, we switch on 


magnetic fluxes, encoded by line bundle vectors, B\, B 2 , B^, on the inherited divisors R\, R 2 , R 3 , 
respectively. On the exceptional cycles E r and E r we use fluxes V r , V r respectively. The multi-index 
r was defined in Section 2.2 To satisfy the flux quantization, we require that |3| 


Bi , B 2 , #3 <E A , 2 Vr , 2 Vr G A . 


(28) 


with A given in (11). Using the intersection numbers from Table [ 2 ] the anomaly cancellation re¬ 
quirement (without the presence of non-perturbative brane effects) in four dimensions amounts to 
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J2( y r) 2 = 12 + 2B 2 B 3 , J](U r ) 2 = 12 + 2B 1 B 3 , 


(29a) 


r r 

B\ ■ V r = 0 , B 2 ■ V r = 0 , Bi • B 2 = 0 . (29b) 

The DUY conditions are more involved compared to the CICY case, as can be seen from Table [2j 
Roughly, they can be characterized as Vol(Rj) > 4Vol(B r ) > 0 (and similarly for the divisors E r ). 

To simplify the analysis we aim to satisfy the tree-level DUY equations for the inherited divisors 
independent of the exceptional ones. Thus the magnetic fluxes are taken to satisfy 


B x + Bo + B 3 = 0 . (30) 

As a convenient ansatz we further assume that sets of two bundle vectors V r (analogously for V r ) are 
always equal or opposite, in such a way that the equivariant identifications are respected and the DUY 
equations © are satisfied simultaneously: 

Vr ir2 r 3 = (-l) r2+r3 V ri00 . (31) 

In particular, by choosing to satisfy the tree-level DUY separately for the inherited and exceptional 
divisors, it is guaranteed that Kahler parameters cii,b r ,b r exist such that all relevant volumes are 
positive. 

Note that in addition, the whole construction (i.e. consistency conditions and spectrum) is invariant 
under exchange of fluxes on divisors with permutation symmetries as explained above, e.g. B\ «->■ B 2 
together with V r <->■ V r . As with the CICYs discussed in the previous paragraph, one also needs to 
take such permutation symmetries into account in order to avoid an enormous over-counting. 

Compared to the favorable CICYs reviewed in the previous section, we find that the Schoen 
manifold has some convenient features in the non-integral basis defined in Table [2j in contrast to the 
favorable CICYs, where all scalar products among different vectors appear in Bianchi identities, most 
of the off-diagonal entries of the Gram matrix for this manifold remain fully unconstrained. Especially 
computationally, this has important consequences in random bundle generation, since we only need 
to generate a handful of U-’s of a given norm, which is significantly faster than any known algorithm 
for generating vectors of fixed inner products. Furthermore, the simple ansatz for the DUY equations 
resolves the seemingly complicated Mori/Kahler cone conditions into very simple inequalities for the 
divisor volumes. For the favorable CICYs, in spite of having parameterized solutions to the Bianchi 
and the DUY conditions, we were not able to find in this set-up an analytic expression to be inside 
the Kahler cone; instead we have to go through the combinatorial exercise of the previous section. 

4.5 Computer implementation 

The computer code that automates the various procedures described above will be made public in 
the future under the name Compactif ier. This code can be thought of as a major extension to the 
□rbif older package j60], which enables the latter to also address smooth line bundle compactifica- 
tions. It uses a lot of the group theoretical routines of the original Orbif older code to perform model 
analyses, but all the routines to generate consistent line bundle model input data and to construct 
the chiral spectra have been written from scratch. Moreover, the anomaly checks of the Orbif older 
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have been extended to include the non-universal Green-Schwarz mechanism on smooth Calabi-Yaus 
with a collection of axions for all three heterotic theories. 

Since the chiral spectra are computed using index theorems, a smooth model construction is in 
practice independent of h\\ and can be implemented very efficiently. On the other hand, the group 
theoretical routines of the original Orbif older have been optimized for fast model building analysis 
independent of the particular string setting under consideration. This means that the construction of 
a smooth model together with the basic consistency checks performed takes in average 0.005 seconds. 
In our specific setting, where we choose to obtain the (MS)SM spectrum via an intermediate SU(5) 
GUT-like theory, it is actually of great computational benefit to first determine the unbroken gauge 
group and proceed with the computation of the chiral spectrum only if the smooth model under 
construction contains an SU(5) group factor. 

In order to ensure that this code is not specific to one particular type of smooth Calabi-Yau space, 
we have detached the definition of the geometry from the actual code. The information is stored in a 
so-called geometry file that contains basic topological data of a Calabi-Yau geometry as described in 
Section [2j like its intersection numbers and the values of the the integrated second Chern classes 
C 2 i- Line bundle data can either be entered by hand on the level of a bundle file that specifies the line 
bundle vectors, or can be randomly generated using the procedures described in this section. 

The results of such model searches using this code are presented in the next section for all three 
heterotic string theories. 


5 Model searches 

We now present our results for heterotic model-building on a selection of favorable CICYs and the 
Schoen manifold. For all scans on CICYs we discuss in this section, the Standard Model is obtained 
from an intermediate SU(5) GUT by modding out an order two freely acting symmetry of the un¬ 
derlying geometry. Scan routines were executed simultaneously for four core-weeks to construct more 
than 10 9 smooth models from each theory on every different geometry. 

Favorable CICYs with h\\ = 1 have not been considered here, since the tree-level DUY equations 
cannot be satisfied inside the Kahler cone. For CICYs with small h\\ = 2, 3, it turns out that the 


conditions (13) together with the Bianchi identities and the tree-level DUY equations are often over¬ 
constraining: At least inside the parameter range |AY| < 40 we have scanned over, we found almost 


no solutions to the Bianchi identities, for most of the favorable CICYs with Hodge number h\\ < 4. 
For this reason we do not consider such very small h\\ CICY geometries in the following. 


5.1 Statistics of model searches 


Tables 11 12 and |13| present our results from model scans over the same set of favorable CICYs with 


4 < hn < 6 and the Schoen manifold of the two supersymmetric EsxEs and SO(32) heterotic theories 
and the non-supersynmretric SO(16)xSO(16) theory, respectively. The left column always lists the 
geometry we have considered. The number refers to the CICY label in the classification list 49,50 


Following the definitions of Section 4.3, the two sets of columns on the right give the total number 
of inequivalent models we have constructed on each geometry. Here we distinguish between upstairs 
and downstairs models, i.e. between SU(5) GUT-like models and the resulting (MS)SM-like models 
obtained after modding out the compatible freely acting symmetry. In particular, the fourth and sixth 
column gives the number of GUT-like and (MS)SM-like models respectively, which are chiral exact. 
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As expected from our remarks in Section 4.3 the number of downstairs models is indeed (much) 


larger than the number of upstairs models. One the one hand, there are various possible group theoret¬ 
ical breakings for the hidden sector gauge group. This fact results in multiple models in the downstairs 
spectrum, which correspond to the same SU(5) GUT. In addition, it is often possible to embed the SM 
gauge group in more than one way inside the observable sector gauge group, by appropriate choices 
of a hypercharge generator Y. such that the desired GUT hypercharge normalization is preserved. 


Geometry-based analysis 

The results of these tables can be interpreted both from the perspective of different geometries or of 
the different theories. We begin with discussing some geometric features: 

The symmetric K3-like CICYs are most fruitful among the set of favorable CICYs we have con¬ 
sidered in these scans. As outlined in section l4~4l this is due to the absence of a norm constraint on 
Vf (i.e. on Ka for all i ) coming from the Bis. Since this statement is solely based on geometric data, 
it is independent of the particular theory under consideration. 

One might have expected that among similar geometries (e.g. symmetric K3-like CICY spaces, i.e. 


the first row of Table 10 without self-intersections), the number of models would grow with increasing 
/in, as the number of bundle vectors increases. Seemingly from the results under inspection this 
expectation is not verified. A possible reason is that we have scanned over the same duration for all 
geometries independently of /in, even though, as remarked in Section[4j generating line bundle vectors 
for a given if-matrix scales polynomially in time with larger h\\. A rough estimate for instance, shows 
that one would need to scan three times longer on the CICY 5302 to produce as many models as on 
CICY 7862. 

Finally, we see that the Schoen manifold is a geometry which is particularly fruitful for the purpose 
of model building. As explained above, the main reason for this is that the Schoen manifold essentially 
only restricts norms of bundle vectors but leaves a lot of freedom otherwise. 


Theory-based analysis 

As far as we know the analysis presented here is novel in the sense that all three heterotic string theories 
are compared side-by-side for the first time when compactified on the same smooth geometries. We find 
that in our scans the Eg x Eg theory is the most fruitful for nearly every geometry (with the exception 
of the Schoen manifold, where SO(32) produces more models) we have scanned. The number of 
interesting GUT-like and SM-like models obtained within the SO(16)xSO(16) theory is comparable 
to the number of GUT- and MSSM-like models in the supersymmetric context. 

Given the revived interest in the non-supersymmetric SO(16)xSO(16) heterotic theory, let us 
concentrate on it in a bit more detail to understand the outcome more closely. In a previous version 
of this paper, we had restricted the line bundle vectors to lie on the lattice RsffiR.8, overlooking 
the fact that bundle vectors in SgffiSg are also compatible with flux quantization. This restriction 
had vetoed for the non-supersymmetric case many bundle vector solutions that are admissible in the 
supersymmetric theories. Consequently, there were thousand times less inequivalent models than in 
the Eg x Eg case on each geometry. Lifting this obsolete restriction we find that the number of semi- 
realistic SM-like models is very similar to their supersymmetric counterparts. A similar statistical 


outcome holds for the number of models with exotics. In particular, as can be seen from Table 13 


we 


are able to find some SU(5) GUT-like (and consequently SM-like) models which are chirally exact for 
either their fermionic or bosonic spectra and sometimes even for both. 
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Inequivalent SU(5) models for the Eg X Eg theory on smooth CYs 


h\ 1 

Geometry 

(Name / CICY #) 

Upstaii 

GUT-like 

rs picture 

Chiral exact 

Downstai 

MSSM-like 

irs picture 

Chiral exact 

4 

Tetra-quadric (7862) 

245,387 

39,375 

1,571,972 

328,445 

4 

7491, 7522 

2,099 

56 

17,928 

764 

5 

7447, 7487 

177,359 

33,046 

1,106,276 

352,458 

5 

6770 

110,823 

8,286 

816,098 

86,975 

5 

6715, 6788, 6836, 6927 

3,011 

239 

17,704 

1,218 

5 

6732, 6802, 6834, 6896 

27,898 

823 

203,210 

11,443 

5 

6225 

2,016 

0 

26,674 

0 

6 

5302 

154,352 

13,905 

807,234 

100,941 

19 

Schoen 

355,745 

186,630 

2,631,198 

1,682,159 


Table 11: This table provides results of our searches for MSSM-like models within the Es x Es heterotic 
theory on selected smooth Calabi-Yau manifolds. The inequivalent SU(5) models with 6 generation 
GUTs become MSSM-like after modding out a freely acting Z 2 Wilson line. 


Inequivalent SU(5) models for the SO(32) theory on smooth CYs 


h\ 1 

Geometry 

(Name / CICY #) 

Upstaii 

GUT-like 

rs picture 

Chiral exact 

Downstai 

MSSM-like 

irs picture 

Chiral exact 

4 

Tetra-quadric (7862) 

159,510 

841 

1,128,286 

5,760 

4 

7491, 7522 

351 

1 

2,956 

8 

5 

7447, 7487 

69,669 

3,385 

561,911 

34,076 

5 

6770 

53,712 

74 

410,830 

547 

5 

6715, 6788, 6836, 6927 

407 

0 

2,204 

0 

5 

6732, 6802, 6834, 6896 

6,017 

2,655 

47,251 

20,227 

5 

6225 

0 

0 

0 

0 

6 

5302 

2,598 

42 

15,326 

232 

19 

Schoen 

493,114 

207,644 

4,029,615 

1,939,579 


Table 12: This table provides results of our searches for MSSM-like models within the SO(32) heterotic 
theory on selected smooth Calabi-Yau manifolds. The inequivalent SU(5) models with 6 generation 
GUTs become MSSM-like after modding out a freely acting Z 2 Wilson line. 
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Inequivalent SU(5) models for the SO(16) X SO(16) 

theory on smooth CYs 

hn 

Geometry 

(Name / CICY #) 

u F 

GUT-like 

•stairs 

Cl 

Fermi 

(icture 

liral exa( 

Scalar 

it 

Both 

Do\ 

SM-like 

vnstairs 

Cl 

Fermi 

picture 

liral exac 

Scalar 

t 

Both 

4 

Tetra-quartic (7862) 

209,743 

281 

263 

1 

1,575,098 

2,370 

2,000 

15 

4 

7491, 7522 

1,873 

0 

1 

0 

14,651 

0 

11 

0 

5 

7447, 7487 

28,209 

901 

46 

5 

149,143 

5,154 

377 

52 

5 

6770 

65,888 

173 

85 

0 

437,327 

914 

707 

0 

5 

6715, 6788, 6836, 6927 

120 

7 

0 

0 

518 

89 

0 

0 

5 

6732, 6802, 6834, 6896 

460 

33 

0 

0 

3,119 

275 

0 

0 

5 

6225 

72 

0 

0 

0 

483 

0 

0 

0 

6 

5302 

355 

22 

0 

0 

1093 

66 

0 

0 

19 

Schoen 

456,594 

5,169 

2,745 

30 

3,002,353 

37,276 

21,955 

237 


Table 13: This table provides results of our searches for SM-like models within the SO(16) x SO(16) 
heterotic theory on selected smooth Calabi-Yau manifolds. The inequivalent SU(5) models with 6 
generation GUTs become SM-like after modding out a freely acting Z 2 Wilson line. In upstairs and 
downstairs picture it is indicated how many models have no fermionic or scalar chiral exotics or 
whether it is fully exophobic at the chiral level (i.e. net number three of chiral SM families and no 
Higgs triplet, doublets). 


Counting the number of states: Superfields versus bosons and fermions 

Concerning the number of chiral states, let us stress that the notion of exotics is very different de¬ 
pending on whether it is used in the supersymmetric or non-supersymmetric context. In order to 
describe the four-dimensional spectrum of one of the supersymmetric strings, we follow the conven¬ 
tion to use superfields which include both bosonic and fermionic states. For compactifications of the 
SO(16)xSO(16) theory, we adopt the convention of |9| to display bosons and fermions separately 
since they are generically in very different representations. Hence, when comparing a supersymmetric 
spectrum to a non-supersymmetric one, one has to realize that even though the number of exotic 
superfields is smaller than the number of exotic fields in the non-supersymmetric theory, this might 
correspond to more exotics in the supersymmetric theories when bosons and fermions are considered 
separately. In particular, the scalar partners of the MSSM matter multiplets are considered as scalar 
exotics in the non-supersymmmetric context. Indeed, given the fact that we start with a smaller 
representation for the scalar fields in the ten-dimensional SO(16)xSO(16) theory as compared to the 
supersymmetric theories, it is to be expected that there are less scalar exotics at the GUT- or SM-level. 

Total number of models and infinite sets 

We close the analysis of the chiral data with a remark on the total number of models we have found. 
First of all, we stress that these scans were not aimed to be exhaustive by any means. The duration 
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Geometry 

(Name / CICY #) 

Heterotic 

theory 

Higgs candidate 

#(5-5) > 1 

Vector-like exotics 

(extra 5-5, 10-10 pairs) 

Tetra-quadric (7862) 

Eg X Eg 

100% 

0% 

SO(32) 

95.7% 

0% 

7491, 7522 

EgxEg 

100% 

0% 

SO(32) 

100% 

0% 


EgxEg 

92.4% 

7.0% 

7447, 7487 

SO(32) 

98.2% 

1.7% 

6770 

EgxEg 

41.9% 

58.0% 

SO(32) 

100% 

0% 

6715, 6788, 6836, 6927 

EgxEg 

100% 

0% 

6732, 6802, 6834, 6896 

EgxEg 

58.4% 

40.9% 

SO(32) 

83.2% 

16.6% 

5302 

EgxEg 

92.1% 

3.3% 

SO(32) 

69.0% 

0% 


Table 14: In this table we indicate the percentage of chiral exact SU(5) GUT-like models obtained 
from the supersymmetric heterotic string theories with additional non-chiral states which are charged 
under the SU(5). The Higgs multiplets can arise from vector-like 5-5 pairs. The other non-chiral 
states constitute vector-like exotics, which can also come from additional 10-10 pairs. 


of four weeks was mostly dictated by our requirement to obtain representative statistical results and 
not by some upper bound on the total number of inequivalent GUT-like/MSSM-like models we were 
able to spot. Furthermore, as explained in 56 for smooth compactifications with line bundles there 


seems to be no clear theoretical bound on the total number of models supported on a given manifold 
inside the validity of the supergravity approximation. In that sense, we think that the very notion 
of exhaustive scans is not well-defined on those smooth manifolds and we have thus concentrated 
primarily on generic statistical findings. 


5.2 Analysis beyond the chiral spectrum 

In this subsection we investigate smooth Calabi-Yau compactifications with line bundles beyond the 
chiral spectrum. This analysis is important since vector-like 5-plets can provide potential Higgs 
candidates while solving the doublet-triplet splitting problem, see e.g. 10 for a discussion of this in 
the current context. For our purposes it suffices to restrict our analysis here to the upstairs GUT 
vector-like spectrum. 

Since the multiplicity operator p2|) is insensitive to fully vector-like states, we calculate their 


spectrum via equivariant line bundle cohomology]^] For this we use the automated computer tool 
Starting from those models that are chiral exact, we compute the full spectrum to 


cohomcalg (61 62 


see whether they have at least one non-chiral 5-5 pair. (In a few cases the exact number of non-chiral 


2 Computation of the full spectrum is unfeasible for the Schoen manifold due to its large Stanley-Reissner ideal. Hence 
we omit it from our discussion here. 
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pairs is not fixed uniquely and depends on further input data.) From such a pair we could obtain 
Higgs doublets without their triplet partners via a suitable choice of a Wilson line. An additional 
source for vector-like exotics could be pairs of 10-10-plets. They will also split under the action of the 
freely acting Wilson line and thus vector-like pairs of exotics can remain in the downstairs spectrum. 

In Table 14 we collect the results of the analysis of the full spectrum for our smooth chiral exact 
models discussed in the previous section. When the chiral scans of the previous subsection did not 
find any chiral exact models on a given geometry, we ignore this geometry in the current analysis; 
for this reason e.g. CICY 6225 is absent in Table [T4j In the second-to-last column we indicate the 
percentage of those models which have at least one 5-5 Higgs pair candidate and no (10-10) exotics. 
These percentages are defined w.r.t. the total number of chiral exact EsxEs and SO(32) models given 
in Tables 0 and p~2| In the last column we list models which, apart from having at least one Higgs 
pair candidate, possess additional vector-like exotic states. These exotics can be additional 5-5 and 
10-10 pairs. If the numbers do not sum up to 100%, the rest of the models do not have a 5-5 pair 
and hence no Higgs candidate. For these we do not distinguish whether or not there are additional 
exotic 10-10 pairs present. 

We observe that most of the chiral exact MSSM models of the supersymmetric theories do have 
vector-like 5-5 pairs which may serve as Higgs candidates in the downstairs spectrum. We also find 
that the number of vector-like exotics (beyond multiple Higgses) is quite small or even negligible in 
almost all cases. Moreover, such vector-like exotics are more abundant in the EsxEs theory than in 
the SO(32) theory. 

We have not included any GUT-like models from the non-supersymmetric SO(16)xSO(16) theory 
in Table 14 since for non-supersymmetric models the definition of chiral exactness differs from that of 
the supersymmetric ones in that chiral scalars are distinguished from fermionic exotics. Nevertheless, 
we have analyzed the spectra of the six models that are both fermionic and bosonic chirally exact on 
the two CICY geometries 7487 and 7862 and found that each of these models contains three scalar 
five-plet vector-like pairs, i.e. Higgs boson candidates, and no additional vector-like Standard Model 
matter. 


5.3 Distribution of the number of chiral generations 

In this subsection we study the likelihood that a given GUT model leads to three generations in the 
downstairs picture using the geometries and the line bundle construction outlined in the previous 
sections. To answer this question statistically, we have plotted the percentage of SU(5) GUT-like 
models with N GUT generations against the total number of models with SU(5) gauge group. For 
this discussion all figures refer to inequivalent models as defined in Section |4~3l In the two histograms of 
Figure[3]we present our statistical analysis for the Schoen manifold and the tetra-quadric, respectively, 
for all three heterotic theories side-by-side. Let us emphasize a few interesting features of these plots: 

First, we see that on both geometries the majority (in most cases more than 70%) of SU(5) GUT 
models are non-chiral, i.e. they have zero net number of GUT generations. The peaks for the number 
of generations lies at either 2 or 4, depending on the geometry and theory under consideration. The 
distribution dies off rather slowly especially for the EsxEs on the tetra-quadric. In other words we see 
that the number of GUT generations is scattered over a much larger range for the favorable CICYs 
with small h\\ than on the Schoen manifold, which has a clear preference for a small number of GUT- 
generations (up to ten at most). For these reasons we have plotted these distributions in Figure [3] 
with two different scales to visualize both the peak of non-chiral models and how the net chiral GUT 
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Figure 3: The two plots show the portion of chiral SU(5)-GUT models with a given generation 
number (in the upstairs picture) for each of the three heterotic theories on the Schoen manifold and 
the tetra-quadric, normalized w.r.t. the total number of SU(5) models we have constructed (well over 
10 7 inequivalent SU(5) models for each theory on the Schoen manifold and well over 10 6 for each 
theory on the tetra-quadric). 
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Table 15: A chiral exact MSSM-like model from the supersymmetric SO(32) theory on the tetra- 
quadric. The left part of the table gives the upstairs GUT-like spectrum and the right part the 
resulting downstairs spectrum after Wilson line symmetry breaking. For clarity we only display the 
hypercharge and omit the other U(l) charges. 


models are distributed. 

For both geometries we mod out an order 2 Wilson line, hence six net GUT generations would be 
phenomenologically preferred. However, we see that this only happens in at most 2% of the constructed 
SU(5) models on the Schoen manifold and at most 6% of the cases on the tetra-quadric. Hence, these 
statistics do not seem to indicate that (MS)SMs with three generations are singled out or favored 
in any way. Instead, we have to veto thousands of phenomenologically uninteresting models, before 
actually obtaining any relevant GUT theory. 


6 Examples of (MS)SM-like models 


In this section we present explicit examples of an MSSM-like model from the SO(32) theory and an 
SM-like model from the SO(16)xSO(16) theory, both on the tetra-quadric (CICY 7862) geometry. 
These examples are chiral exact models, the first in the supersymmetric sense defined above and the 
latter at the level of fermions. Since for the EsxEs theory an example of a chiral exact MSSM- 


like model was presented in 56 with an extended analysis of its EFT, we refer to that paper for a 


comprehensive presentation for a chiral exact example from the EsxEs theory. 
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A chiral exact MSSM-like heterotic SO(32) model 


In Table 15 we consider a specific example for an SO(32) smooth model. We present the defining line 
bundle vectors and the Z 2 Wilson line, as well as the resulting upstairs and downstairs spectra for the 
hidden and the observable sector. The rows are organized such that the projection and branching due 
to the Wilson line can be reconstructed for the various upstairs to downstairs representations. One 
may verify that all gauge anomalies are cancelled. 

Note that the number of 3- and 2-plets in the downstairs spectrum arise from two types of states in 
the upstairs picture: (5; 1,1,1,1) and (5; 1, 2,1,1) both of which have multiplicity 2. The downstairs 
multiplicity is obtained as: (2 + 2 • 2)/2 = 3. The additional SU(2) under which the (5; 1, 2,1,1) are 
charged may therefore be a way to distinguish the first two lighter generations from the third. 


A chiral exact SM-like heterotic SO(16) X SO(16) model 

Next, we present one of the SO(16)xSO(16) fermionic chirally exact SM-like models. The model is 
summarized in Table 16 Since the model is non-supersymmetric we list the complex bosons and 


chiral fermions separately on gray and white backgrounds, respectively. It might seem that this model 
contains many more states than the supersymmetric examples. However, one should keep in mind 
that we give the spectra of bosons and fermions separately for the non-supersymmetric GUT models. 
In contrast, for the supersymmetric models we follow the standard convention to give the spectra in 
terms of superfields. If one were to write out the full bosonic and fermionic spectra of the superfields, 
one would find that the non-supersymmetric models contain less scalars than their supersymmetric 
partners. In fact, this is to be expected, since the supersymmetric theories in ten dimensions contain 
496 gauge fields and gauginos, while the non-supersymmetric SO(16)xSO(16) has only 240 gauge 
fields and 512 charged fermions. 

It is amusing to note that exactly the same bundle and Wilson line provide us with an MSSM-like 
model for the EsxEs theory; however this MSSM-like model is not chiral exact. The feature that the 
same bundle leads to both MSSM- and SM-like models is not actually that rare and demonstrates 
again how closely related the theories are, at least to leading order at the level of massless spectrum. 


33 




SO(16) X SO(16) SM on CICY 7862 with line bundle vectors and 7L 2 Wilson line: 
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SU(5)'xSU(4)"xSU(3)" 

SU(3)'xSU(2)'xSU(3)"xSU(2)"xU(l)^ 

observable 

24 (5; 1,1) 

20 (5; 1,1) 

12 (3,1; 1,1) 1 +12 (1,2; l,l)i 

3 2 

10(3,1; 1, l)i +10 (1,2; 1,1) 1 

3 _ 2 

6 (10; 1,1) 

6 (5; 1) 

3 (3, 2; 1,l)_i + 3 (3, 1;1, 1)2 + 3 (1,1; 1, l)_i 

3 (3,1; 1,1) 1 +3(l,2;l,l)i 

3 2 

hidden 

2(1; 6, 3) 

12(1; 1,3) 

8(1; 1,3) 

56(1; 1,1) 

(1,1; 3, 2) 0 + (1,1; 3, 2) 0 + (1,1; 3, 1) 0 + (1,1; 3, 1) 0 

6 (1,1; l,2)o+ 6 (1,1; l,l)o 

4 (1,1; 1, 2) 0 + 4 (1,1; 1,1) 0 

28(1,1; l,l)o 

2(1; 4, 3) 

12(1; 6,1) 

14(1; 4,1) 

8(1; 4,1) 

8(1; 1,3) 

134(1; 1,1) 

(1,1; 3,2) 0 + (1,1; 1, 2) 0 + (1,1; 3,1) 0 + (1,1; 1,1) 0 

6 (1,1; 3,1) 0 + 6 (1,1; 3,1) 0 

7 (1,1; 3,1) 0 + 7 (1,1; 1, l)o 

4(1,1;3,1) 0 + 4(1,1; l,l)o 

4 (1,1; l,2)o+ 4 (1,1; l,l)o 

67(1,1; l,l)o 


Table 16: A chiral exact SM-like model from the SO(16)xSO(16) theory on the tetra-quadric. In 
this table we use the same conventions as in Table 15, except that we depict the complex bosons and 
chiral fermions on gray and white backgrounds, respectively. 
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7 Conclusions 


We have performed model searches on smooth Calabi-Yau compactifications for both supersymmetric 
heterotic EgxEg and SO(32) theories and for the non-supersymmetric SO(16)xSO(16) theory simul¬ 
taneously. As far as we are aware this is the first work in which all these three theories are compared 
in this side-by-side fashion. 

Considering smooth Calabi-Yau compactifications of all three heterotic theories is particularly 
beneficial for the non-supersymmetric SO(16)xSO(16) theory, since it is guaranteed to avoid tachyons 
at leading order in o! and g s . Furthermore, we can make use of many known methods to analyze 
the phenomenological properties of such constructions. We focused on a subset of favorable CICYs 
of relatively small h\\ < 6. In addition, we considered the Schoen manifold, described as an orbifold 
resolution, to have an example with larger h\\{= 19). In order to obtain some systematic results we 
considered exclusively line bundle gauge embedding on these geometries. 

Generic line bundle backgrounds can be characterized by a collection of h\\ sixteen-component 
bundle vectors V). Using this description the Bianchi identities without NS5-branes are quadratic 
Diophantine equations which are very hard to solve. For this reason we introduced Gram matrices, 
defined as Kij = V) • Vj , to systematically analyze the combined consequences of the Bianchi identities 
and the tree-level DUY equations. If one takes a point inside the Kahler cone as input, these equations 
can be interpreted as a linear system of 2 • hn equations of ^ /in (/in + 1) parameters. Not all such 
solutions can be used for model building as not all of them can be written as Gram matrices. For 
those that can, we generate corresponding sets of line bundle vectors for the three heterotic theories. 
Since the lattice on which the bundle vectors of the non-supersymmetric theory lie is a sublattice of 
both the EgxEg and SO(32) lattices, all bundle vector solutions of the SO(16)xSO(16) theory are 
automatically also solutions of the supersymmetric theories. 

We construct the full charged chiral spectrum of these models by exploiting the multiplicity op¬ 
erator in four dimensions. For each of the spectra we check all pure non-Abelian as well as mixed 
Abelian-gravitational and pure Abelian anomalies using the generalized Green-Schwarz mechanism 
with /in + 1 universal and non-universal axion couplings. This constitutes many consistency checks 
on the charged chiral spectrum. (For example, for h\\ = 5 and only two non-Abelian SU(Y) factors 
we already have more than 50 anomaly conditions.) 

In order to have a way of comparing the model building potential of the various heterotic theories 
on the various geometries, we have preformed computer-aided scans for all cases for a fixed period 
of time. The total model search duration seems to be arbitrarily chosen; indeed, as was pointed out 
in 1561, there does not seem to be a clear-cut bound on the line bundle input data. Consequently, we 
arbitrarily constrained the duration of scans to the same period of time for each heterotic theory on 
the various geometries under consideration. 

For all three heterotic string theories we have generated a large number of GUT-like models (up 
to over a few hundred thousand) which become (up to a few million) (MS)SM-like models upon 
using a freely acting Wilson line. We find that having three generation models does not seem to be 
especially singled out. Chiral exact models are not as abundant as models with additional vector-like 
SM-fermions. However, we were able to construct chiral exact non-supersymmetric SM-like models, in 
both the bosonic and fermionic sense, which even contain Higgs candidates in the form of vector-like 
five-plet pairs. 
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